Posts about AI Agents

Claude Plays Factorio

Jack Hopkins tests LLMs in his 'FLE' - Factorio Learning Environment.


Is Reasoning Language?

Exploring the nature of reasoning in AI models, questioning if making LLMs express their thoughts out loud limits their potential.


Knowledge Search Improvements

Significant improvements to Ada's knowledge retrieval enhance customer support accuracy, thanks to innovative team collaboration and ML advances.


Anthropic Computer Use

Just tried out Anthropic's Computer Use demo in a Docker setup! It can control a virtual machine and run tasks like adding a knowledge base for our bots. Super impressive, but it did trip up on some commands and interactions. Excited to see where this tech goes!